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Abstract 

A staircase in this paper is the set of points in Z 2 below a given rational line 
in the plane that have Manhattan Distance less than 1 to the line. Staircases 
are closely related to Beatty and Sturmian sequences of rational numbers. 
Connecting the geometry and the number theoretic concepts, we obtain three 
equivalent characterizations of Sturmian sequences of rational numbers, as 
well as a new proof of Barvinok's Theorem in dimension two, a recursion 
formula for Dedekind-Carlitz polynomials and a partially new proof of White's 
characterization of empty lattice tetrahedra. Our main tool is a recursive 
description of staircases in the spirit of the Euclidean Algorithm. 

1 Introduction 

Motivated by the study of lattice points inside polytopes, in this paper we seek to 
understand the set of lattice points "close" to a rational line in the plane. To this 
end we define a staircase in the plane to be the set of lattice points in the half-plane 
below a rational line that have Manhattan Distance less than 1 to the line. We 
prove several properties of these point sets, most importantly we show that they 
have a recursive structure that is reminiscent of the Euclidean Algorithm. 
Not surprisingly, staircases are closely related to the Beatty and Sturmian sequences 
defined in number theory (see |Sto76[ [FMT781 IF5901 IO'B02j ). i.e. to sequences of 
the form (Lf n J ~ I — ^)J) ^ or a ' ^ ^ ^ wl ^ x S c d(a, b) = 1. We show several 
elementary properties of these sequences from a geometric point of view. To our 
knowledge such a geometric approach to these sequences is not available in the 
prior literature. Our observations lead to three characterizations of these sequences 
( Theorem 14. lj) . One of these is known (see [GLL781 IFra05| ) while the other two 
seem to be new. 

We conclude the paper by giving several applications of our findings. Firstly, we give 
a new proof of a theorem by Barvinok in dimension 2. Barvinok's Theorem states 
that the generating function of the lattice points inside a rational simplicial cone can 
be written as a short rational function. While Barvinok uses a signed decomposition 
of the cone into unimodular cones to achieve this result, we partition the cone into 
sets that have a short representation. 

Secondly, these ideas can also be used to give a recursion formula for Dedekind- 
Carlitz polynomials. These arc polynomials of the form Ylt=i xk ~ 1 U 

or, equiv- 
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alently, generating functions of the lattice points inside the open fundamental par- 
allelepipeds of cones of the form cone (( _?i ) ,("))• O ur recursion formula answers 
a question from [BHM08] . 

Finally, we simplify ideas from [Sc a85j and |Rez06j to give a partially new proof 
of White's Theorem, which characterizes three-dimensional lattice simplices that 
contain no lattice points except their vertices. 

This paper is organized as follows. In Section [2] we give definitions of and present 
elementary facts about the objects we study. While Section [2] is rather concise, 
we elaborate more in Section [31 where staircases are examined from a geometric 
point of view. Most importantly we explain the recursive structure of staircases in 
Lemma 13. 9[ 13.111 and 13.121 In Section [4] we motivate the three characterizations of 
Sturmian sequences before summarizing them in Theorem l4.ll Section [5] is devoted 
to the proof of this theorem. In Section [6] we apply Lemma f3. 121 to give a new proof 
of Barvinok's Theorem in dimension 2 and in Section [7] we use Lemma l3.11l to give 
a recursion formula for Dedekind-Carlitz sums. We conclude the paper in Section [8] 
by giving a partially new proof of White's Theorem. 

2 Staircases and Related Sequences 

In this section we give the basic definitions we are going to work with. In particular 
we introduce staircases, which are the main geometric objects we will analyze. Then 
we will define some related sequences of integers and state basic facts about them 
and their connection to staircases. We elaborate on the geometric point of view and 
give additional examples in Section [5J 

Before we introduce staircases, here are some preliminaries: For any real number 
r G R we define the integral part lr\ := max {z G Z | z < r} of r. The fractional 
part {r} of r is then defined by r = \ r\ + {r}. Given < a, b G N there exist unique 
integers (b div a) and (b mod a) such that b = (b div a) ■ a + (b mod a) and 
< b mod a < a. Using these two functions we can write I —J = b div a and 
{-} = b m ° d a . We are going to use these two notations interchangeably. 

Given A, B C R 2 and v G R 2 , we define A + B := {a + b \ a G A, b G B} and 
— A := {—a \ a e A} and we use the abbreviations A — B := A + (-B) and A + v = 
A + {v}. We will refer to A + B as the Minkowski sum of A and B. The difference 
of sets is denoted with A \ B := {a G A\ a g' B}. A lattice point is an element 
of Z 2 . An affinc (linear) lattice transformation of the plane is an affine (linear) 
automorphism of the plane that maps Z 2 bijectively onto Z 2 . A vector v G ~L d is 
primitive if gcd(i>i, . . . , Vd) = 1 or, equivalently, if T d n conv(0, v) = {0, v}. 

Now, what is a staircase? Let L be an oriented rational line in the plane. Then L 
defines a positive half-space H . The task is to describe the lattice points in H that 
are close to L in the sense that they have distance < 1 to the line in the Manhattan 
metric. Equivalently we consider those points x G Z 2 R H from which we can reach 
a point in the other half-space by a single horizontal or vertical step of unit length. 
Such a set of points we call a staircase. See Figure [T] for two examples. Note that 
it is sufficient to depict the staircase only under a primitive vector generating the 
line (in the first example the vector (3, 8)), as after that (and before that) the same 
pattern of points is repeated. 

We do not require L to pass through the origin (or any other element of Z 2 ), 
contrary to what Figure [1] might suggest. But we will see later that we can get 
all the information we want by looking only at lines through the origin. Also, 
without loss of generality we will restrict our attention to lines with positive slope, 
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as negative slopes will give us, up to mirror symmetry, the same sets. 




/(0,0) 

Figure 1: A part of the staircases 65,2 and Sag. Corners are shown as boxes. 



Let < a, b G N with gcd(a, b) = 1 and let r E R. These parameters define the 
line L a ,6,r = G R 2 | x 2 = ~ £1 + ?*}. We denote the closed half-spaces below 
and above that line by H7~ b and H~ b , respectively. Formally we define for any 
<j G {+1,-1} the half-space H° b r as 



,b 

< a\- X\ - x 2 + r) 
a 



Most of the time we will use + to represent +1 and — to represent —1 and write 
(7 G {+, — } for short. Also if a = + and/or r = we will omit these parameters 
and write H a j, for , and similarly for the symbols introduced below. The case 
a = + and r = is of the largest interest to us, as all other cases can be reduced 
to this one. 

We will give a precise formulation and proof of this in Lemma 13.21 Although 
the proof does not require additional tools and could already be given here, we 
will pursue the connection between certain sequences and our point sets first, and 
postpone all observations that are purely concerned with the point sets to Section[3] 
So let's start defining these sets properly, to make it more clear what we are talking 
about. 

The following definitions are illustrated in Figure [5] The lattice points in H° b 
that are at distance less than 1 from the line in vertical and horizontal direction, 
respectively, are 

V° btr = Z 2 n H: Ar \ (HZ Ar - ae 2 ) 

< a(-Zi - z 2 + r) < 1 



z G 

z 2 n K b ,, 

72 



a 

2 , 



z G 



,b N b 

< a(-zi - z 2 + r) < - 
a a 



Using this notation we now define the staircase S a 7 br to be the set of points that 
are at distance less than 1 from the line in horizontal or vertical direction, and we 
define the corners C° b r to the be the lattice points that are at distance less than 1 
in horizontal and vertical direction: 

^a.b.r ^a,b,r ^ '*'a 3 b,ri 

s~«J yj(T p. nja 

^a,b.v *a,b,r ' ' r ^a,b,r' 
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In other words 

Sl ih = {zeZ 2 \zeHl b butz-ae 1 <tHl b oTz + *e 2 gHZ <b } 
Cl b = {z £ Z 2 | z £ H a a h but z -aei & H a ab and z + ae 2 <£ H° b ) . 

See Figure [1] and also Figure [2J Clearly C a ,b C 5 a ,6. For any set A c Z 2 and any 
x £ Z we call the set col x (A) = {(x,y) £ A} a column of A and for y £ Z we call 
the set rowj,(A) = {(x,y) £ A} a row of A For any < a, o £ N, every row and 
every column of S a ,b contains at least one point and every row and every column 
of C a ,b contains at most one point. 

The sequence (\col x (S a ,b)\)x£Z is called the column sequence of S a ,b, the sequence 
(|rowj / (C a ,6)|) !/ ez is called the row sequence of C a ,b and so on. 

In the following we summarize some basic facts about staircases. We omit the proofs 
as they are easy enough to do and would slow the pace of this section without giving 
the reader further insights. The reader may find it instructive, however, to check 
the validity of these facts by looking at examples such as those given in the figures 
of this section. 

Fact 1. For all < a, b £ N and a £ {+, -} 

a>b & H; t CV;j & Va; : 1001,(5^)1 = 1 & : |row„(C£ 6 )| = 1, 
a<b K !b DVZ ib Vy:|row„(S* 6 )[ = l Vx : |col x (C Q CT )6 )| = 1. 

In the former case we call S° b flat and in the latter case we call S° b steep, see 
Figure H Note that this implies S% >b = V£ 5 for flat and S% b = W ab for steep 
staircases. 




Figure 2: This figure shows the flat staircase and the steep staircase S33 from 
Figured together with parts of the corresponding sets 7i a b and V a b . This illustrates 
Fact[U 



Fact 2. For all n £ Z the topmost point of col n (5 ai b) is (n, \_^n\ ). 

If S a , b is flat, we have seen in Fact [T] that for every n £ Z the set col n (5 a ,&) contains 
exactly one element, so all elements of S a:b have the form (n, n\ ). If S a b is steep, 
this is only true for the corners. 

This description allows us to compute the difference in height between the topmost 
points in consecutive columns of S a ,b- For all < a, 6 £ N wc define the sequence 
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B a .b = {B a , b (n))nez by 



B a ,b( n ) 



■(n-1) 



(2.1) 







H 









Fact 3. If a < 6 (i.e. 5 a ,b is steep) then 

|col„(S , 0)6 )| = B a ,b( n ) 
and if < a (i.e. S a ,b is fiat) then 

|col„(C 0jb )| = B a ,b(n), 
in particular B a j, is a 0, 1-sequence in this case. 

The sequence B a b is the key to connect the geometric description of "points close 
to a line" with notions from number-theory 

The sequences (|_|n_|)„ eN , (-B 0)6 (n))„ e N and CB a ^(n)) neN (see below) are known 
as the characteristic sequence, the Beatty sequence and the Sturmian sequence of -, 
respectively. 

These sequences are well studied in number theory, see |Sto761 IFMT781 IPS901 
IO'B02j for surveys, [Bro93j for historical remarks and |Fra05j for a discussion about 
the names of the sequences. However, only the characteristic sequences of irrational 
numbers are non-trivial from the point of view of number theory. They also appear 
in geometry, see Section [5J 

For the rest of the section we will establish some definitions connected with the 
above sequences, and some basic properties of B a ^, relating them to the staircases. 

In this spirit, instead of working with (|_-nJ)„ e H> (B a ,b{n)) n eM and CB a;fc (n))neN, 
we will deal with (|_£"-_|)™e z ' (B a ,b(n)) n £Z and CB Q){ ,(n))„ ez , respectively. This is 
due to the fact that we look at staircases of lines, not of rays. 

Now let's define what a Sturmian sequence is. A sequence ,s = (s n ) n€ z of integers 
s„ G Z is called balanced (at k) if s n G {fc, fc+ 1} for all n G Z. If s is balanced at k, 
we can define a 0, 1-sequence s = (s n ) n£ z, which we call the reduced sequence, by 



Note that if s = (c)„gz is constant, s is balanced at both c and c 
s is defined with respect to c, i.e. s is constant 0. 



1. In this case 



Lemma 2.1. B a j, is balanced at [^J . If ^ G Z, then B a ,b( n ) = ~ f or a ^ £ Z. 

Proof. By (|2.1|) we know that \B a , b (n) — £| < 1 and by definition B a> b G Z. If 
- G Z, then the fractional parts in (|2.ip are both 0, and thus the second statement 
is also true. □ 

So Sturmian sequences (B_ a b (n)) n( z^ are well-defined. Furthermore, we now know 
that only two different integers appear in B ahl and that B ab tells us in which 
positions the larger integer of the two appears. 

Given our geometric interpretation of B a o from Fact [3] this means that a steep 
staircase has columns of only two different lengths and the reduced sequence 
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(_B a h (n))„gN encodes which columns are long and which columns are short. We 
will return to the concept of reduction in Section [3] 

To make the connection between the sequence B a ^ and the point set S a f, more 
transparent, we introduce some more notation. 

We will transfer the idea that all we want to know about the staircase can be found 
under a primitive integer vector in the line L a ^ into the language of sequences. 
For any sequence s = (s n )nez we say that s is periodic with period a £ N if 
(s n + a )nez = (sn)nez- We say that a is the minimal period of s if there is no period 
a' G N of s with a' < a and write P(s) for the minimal period of s. By (|2.1[) . if 
gcd(a, b) = 1, then -B a is periodic with minimal period a. 

For a periodic sequence s we define period(s) = (s n )o< n <p(s)- If s is a periodic 
0, 1-sequence, we write l(s) for the number of ones in period(s). We will frequently 
represent s by the P(s)-tuple pcriod(s). 

As B a b describes the differences of the maximal heights in adjacent columns of 5a, 6, 
these differences, accumulated between and a — 1, must sum up to b. 

We summarize the above observations into 
Fact 4. If < a, b G N and gcd(a, b) = 1, then 

P(B a ,b) = a and ^ B Btb (n) = b. 

0<n<a 

In particular if a > b (and thus S aj t, is flat), then l(i? a ,b) = b. 

To be more flexible when talking about parts of staircases respectively Bcatty- 
scqucnccs, we define the following. Given a sequence s = (s r i)nez> a finite subse- 
quence of the form 

S \[x ,X!] : = (Sri)*o<r><*l f° r SOme ^0 < X\ £ Z 

will be called an interval. The number of elements si — xo + 1 of sl^^jj we 
will call the length of the interval and we will denote it by length (sl^ , X1 ])- If s 
is a 0, 1-scqucnce, we will denote the number of ones in an interval sj^ .,,^ by 
ones(s|[ X0>;El] ). 

In Fact 5] we summed over the interval B a ^\ [0,0-1] • But because of the periodicity 
of S a ^b and B a t we see that we could have used any interval of length a — 1. So for 
any fixed i € Z the sequence B a ^(n + i)) n ez also describes S a ,b- This gives rise to 
the following definition: 

We say that sequences s = (s T i)?iez and s' = (s' n ) n ez are identical up to shift if 
there exists an i £ TL with (s„+;)„gz = ( s n)nez- m symbols s = s' . Our goal in 
Section [4] will be to characterize Sturmian sequences up to shift. 

3 Geometric Observations 

In this section we develop some properties of staircases and their related sequences 
from a geometric point of view. The most important operation on staircases is for 
us the reduction, which we turn to in the latter half of this section. We start with 
some more elementary operations. 

Throughout this section let < a, b e N such that gcd(a, b) = 1 and let a G {+, — }. 
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Elementary Properties of Staircases. As we have already mentioned (and 
used) before, all staircases with a given slope, regardless whether it's the one above 
or below the line, are translates of each other. Hence they yield the same step 
sequence up to shift. Before we finally prove this, we state an elementary lemma. 



Lemma 3.1. Let r G 

for some k G Z. 



The line £ a ,fc.r contains a lattice point if and only if r = 



Proof. Without loss of generality we can assume — 1 < r < 0. For any point z G Z 2 
the vertical distance to the line L„ b is -zi — zo = - for some k G Z. So if r ^ - 
for any k G Z, then L abr cannot contain a lattice point. 

That r = - is sufficient for the existence of a lattice point follows directly from the 
extended Euclidean Algorithm. It can also be shown with this geometric argument: 
L a .b.r can contain at most one lattice point z with < z\ < a, for if there were 
two distinct lattice points with this property then gcd(a, b) ^ 1. On the other hand 
V at n ([0, a) x R) contains exactly a lattice points, one in each column. Only the 



lines L 



with < k < a 



1 can intersect V ab . So each of them has to contain 



at least one lattice point. 

Lemma 3.2. For every < a, b G N and r G R 

1- S a ,b.r = S a ,b + V and C a .b.r = C a h + V for some v G 



□ 



and 



S nh — S a , 



id c; 



G a ,b + v for some v G 



Proof. 1. By Lemma [331 if | < r < ^±1 then S a ^ r = S aJ) k and C a ^ r = C ah h. 
Hence we can assume without loss of generality r = -, so the line L ,6,r contains a 
lattice point v = (vi, V2) with vi = -v\ + r. Then 



z G H a b r - v ^=> Z2 + V2 < -(zi+vi) - 

a 

b 

?2 < ~Z\ z G H a b- 
a 



This implies the first claim. 

2. By Lemma [3TT1 there is no lattice point v' with 



V; 



2 G 



< -(-«! -z 2 ) < 



1 - v' 2 < 1, so 
a- 1 



<7 



Also by Lemma 13 - 1 1 there exists a lattice point v with ~v\ — V2 
this point v 



0-1 



and for 



V",, - « = 



z G 



Z G 



z G 



< -(-(si + w x ) - (z 2 + v 2 )) < — 
a a 



,6 a — 1 , a 
0< -(-zi-22 ) < - 



1 



a 



a - 1 6 

> -zi - 22 > 

a a 



Applying the first observation again, we obtain 



y- - v = V + h . 

a,b a,b 



A similar argument shows H~ b —v = H^ b for a suitable v. Now, because of FactrjJ 

' a ab and C° b = V^ fc , depending on whether 
}. Therefore the above calculations imply 13. 2T^1 □ 



OCT 

D a,b 



Kb and C a, b 



a > b or a < b, where a G {- 



K, b or S^ b 
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The previous operations translated the staircases by an integral vector. Now we 
will introduce some other useful operations. We denote the reflection at the main 
diagonal by \ and the reflection at the origin by X; i- e - we define \ (x, y) = 
(y,x) and ^(x,y) — (~x,~y). Note that both induce involutions, i.e. self-inverse 
bijective maps, on sets of lattice points; so we understand a set of lattice points if and 
only if we understand its reflection. The effect of these two reflections on staircases 
is illustrated with an example in Figures [3] and |4] and formalized in Lemmas 13.31 
and GOD 




Figure 3: The reflection at the main diagonal swaps numerator and denominator 
of a staircase and places the points on the opposite side of the line. Here we see 
\$3,& = S 83 . 




Figure 4: The reflection at the origin transforms a staircase below the line into a 
staircase above the line and vice versa. Here we see X^.s = S 38 . Notice how the 
column sequence is reversed! 



Lemma 3.3. \S^ b = S£ and \C£ t = C£. 

In other words, reflection at the main diagonal swaps numerator and denominator 
of the slope and places the points on the opposite side of the line. See Figure [3] 

Proof. We compute 

(x,y)€H° b & ay<a(-x) & ax > a{^y) 

a o 

& -((tx) < -<r(|y) & (y,x) e 

& \(x,y)EH b :°. 

This implies both 

z - vet $ Hl b \(z) + {-a)e 2 £ H b ° and 

z + ae 2 ^Hl ib & \(z)-(-a) ei #H£. 

All three equivalences taken together give \S% b = S b ° and \C f % = C b °. □ 
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Lemma 3.4. X^, b = S~° b and X^ b = CJ. 

Thus (\col n (Sl b )\) n = (\coL n (S-t)\) n and (\col n (C^ b )\) n = (\coL n (CJ)\)n- 

This means that reflection at the origin maps a staircase below the line to the 
staircase above the line and vice versa. This operation reverses the Beatty sequence 
of the staircase. See Figured] 

Proof. We compute 

(x,y)€H° b ay<a(-x) ^> -a(-y) < -a(-(-x)) 

a a 

& (-x,-y)eH-Z & X(x,y)eHJ. 

which implies both X S% b = S~ b and X G a a b = C~ b , like in the proof of LemmaEPl 
For the second claim of the lemma we observe (using what we have already shown) 
that 

(n,y) 6 (col„(S£ 6 )) « (n,y) e 5 a % & (-n,-y)ej?^ 

« U-i/)eSJ « (-n > -i,)e(col_ n (S-f)). 

As this gives us for any fixed n a bijection between the sets co\ n (S% b ) and 
col- n (S~ b ), their cardinality must be the same. The argument for C a J b is anal- 
ogous. □ 

Putting Lemmas 13.2.21 and 13.41 together, we immediately obtain the non-obvious 
statement that reversing a Beatty sequence yields the same sequence up to shift. 

Corollary 3.5. (\coln{S^ b )\) n = (|coL„(5 a %)|)„. 

Proof. (|col„(5- fc )|)„ = (|col„(S a 7) + v\) n = (|col„(5 a 7)|)„ = (|col_„(S£ 6 )|)„ □ 

Similarly, Lemma 13 . 2.11 implies that C a ,b and C a ,b.r have the same column sequence 
for any r. 

Recursive Description of Staircases. We now return to the operation called 
reduction, which we defined for balanced sequences in Section[2] First, let us observe 
the relation between Beatty and Sturmian sequences more closely. The following 
fundamental lemma tells us that, not surprisingly, Sturmian sequences are Beatty 
sequences with a > b and vice versa. 

Lemma 3.6. B_ a b = B a ^ b mo( j a - Conversely if s is a sequence balanced at k G N 
and s = B a<b , then s = B a , ak+b . 

Proof. By (|2.1[) we observe that for any k G Z such that both b and b + ka arc 
positive 

B a ,b( n ) + k = B a . b+ka (n). 

B a j, is balanced at [^J = b div a by Lemma 12.11 Note that by definition 
b mod a = b — (b div a) a. So 

B-a,b{ n ) = B a ,b{n) - b div a = B a ,b mod a (n). 

Conversely if s(n) = B a b {n) and s is balanced at k, then 

s(n) = s(n) + k = B a . b (n) + k = B a>b+ka (ri) 

□ 
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How can this relation be phrased in terms of the staircases S a ,b and S a ,b mod a? The 
following lemmas give an answer to this question. See 65,13 and 6*5,3 m Figure [5J 

Lemma 3.7. Let < a < b. The lattice transformation A = (, ^ gives 
a bisection between C a ,b and S a ,b mod a- 

The corners C a ,b of the staircase S a ,b are just the points of the smaller staircase 
S a .b mod a up to a lattice transform. Here "smaller" refers to both the number of 
lattice points in a given interval and the encoding length of the two parameters a 
and b. Note that the inverse of A is A^ 1 = ( <jiv a 1 ) • 

Proof. As S at b is steep, col„(C Q , 6 ) = {(n, and col„(5 a , b mod a) = 

{(n, [ bm ° da n\)} by Facts Q] and [2 But 

n \ _ f n \ a ( n 

l^n\ J ~ \(b div a)n + [ bm ° da nj J ~ \ [ bm ° da 7 

□ 

However, to obtain all points in S a ,b from the corners C a ^b we need to know which 
columns of 5 a ,b are long and which are short (in the case b > a). It turns out 
that the corners in long columns are precisely the corners C a ,£, mod a of the smaller 
staircase, again up to the lattice transformation A. 

Lemma 3.8. Let < a < b. Then coln(C a .b mod a) contains a point if and only if 
col n (S a ,b) is long. 

Proof. S a .b mod a is flat. So we know by Fact [3] that (n, [ b m ° d a n\ ) g C a .b mod a 
if and only if 1 = Ba.tmodaH = B_ a b( n )- But this just means that coL^S^f,) is 
long. □ 

Taking the two lemmas together, we can describe every staircase S a ,b in terms of the 
smaller corners and points in the smaller staircase S a ,b mod a- This result, together 
with our ability to swap the parameters a and b (by means of Lemmas 13.31 and 13 .4[) 
and the fact that the staircases S a ,i arc easy to describe, we obtain a recursive 
characterization of all staircases. 

Let us look at an example, which is shown in Figure O before we formulate the 
recursion formally in Lemma 13.91 We want to express 5*5,13 in terms of smaller 
staircases. 

We know that the topmost points in each column are the points in 6*5,13 and C5.13 
is just the image of S*5,3 under the lattice transformation A = (21)- Note that A 
keeps columns invariant. 

We also know that 6*5,13 has columns of lengths 2 and 3 and that the long columns 
are precisely those in which C*5,3 contains a point. So if we have an expression for 
65,3 and C*5,3, we can give an expression for 65,13 and 6*5,13. 

To continue this argument inductively, we need to swap the parameters a and b, 
but this we can achieve by reflecting the staircases at the origin and at the main 
diagonal. So we reduce the problem of describing 65,3 to the problem of describing 
63.5. We can now continue in this fashion, expressing 63,5 in terms of 63,2, in terms 
of 62,3, in terms of 62,1- 

At this point we have finally reached a staircase with integral slope. These staircases 
have the nice property that all columns and all rows are identical and hence they 
can be described by a simple expression: the Minkowski sum of the lattice points 
on a line with those in an interval. This entire process is illustrated in Figure [5] 
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Figure 5: This figure show the recursive process of expressing 55.13 in terms of 
smaller staircases, described in the text. In this figure, empty squares indicate the 
corners of long columns, filled squares corners of short columns. Note that the 
empty squares occur in S a ,b precisely in the columns, in which there is an element 

of C a ,b mod a - 



Lemma 3.9. Let < a, b <G N and A = 



1. If a < b and gcd(a,6) = 1, then 



1 
b div a 1 



Ca.b — AS a ^b mod a 

Sa.b = AS a ,b mod a + {( ) J ■ ■ ■ > ( — (b div a)+l ) } 
U AC a ,b mod a + { ( -(b div a) ) }■ 

2. Ca.b = X\C b M and S a . h = X\S b ,a 

3. 7/6=1, then 

Ca.b = {{f):keZ} 

S a ,b - {(^):fc£Z} + {(°),...,(V)} 

In Section[4]we use this recursive structure to develop a characterization of Sturmian 
sequences. In Section [5] we employ the recursion to obtain short rational functions 
that enumerate the lattice points inside lattice polytopes in the plane, and in Sec- 
tion [7] for a representation of Dedckind-Carlitz polynomials that is computable in 
polynomial time. 

Proof. 1. By Lemma 13.71 the first equation holds. Every column of S a> b contains 
a corner and every column contains at least (b div a) points. So AS a .b mod a + 
{( § ))••■>( —(6 div a)+i ) I contains all points in S a ,b except the bottom-most points 
of the long columns. By Lemma 13.81 the long columns are precisely those in which 
S a ,b mod a has a corner. So AC a ,b mod a + { ( _ 6 °iv a ) } is precisely the set of bottom- 
most points of the long columns of S a ,b- 

2- S a .b = XSa.b = X\ $b,a by Lemmas 1331 and l3~il and similarly for C a>b . 

3. If b = 1, then for all k, n E Z we have = k if and only if ka < n < 

(k + l)a- 1. Hence row fe (5 a , h ) = {( fe fe a ) , . . . and row fc (C a , fc ) = {( k £ )}. 

□ 
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Relation to the Euclidean Algorithm. This recursion is closely related to the 
Euclidean Algorithm, which takes as input two natural numbers ci,C2 6 N. In each 
step Cj+i = Cj_x mod c, is computed. This continues until we reach a j such that 
Cj+i = and Cj ^ 0. Then Cj = gcd(ci,C2). 

Now suppose we want to determine S b . a and C b ^ a for some b > a. We flip the two 
parameters and then reduce the staircase, i.e. we applv l3~9l 2 and !3.9H . This reduces 
the problem to computing S ajb mo d a and C a . b mod a- Again we flip and reduce, which 
reduces the problem to computing S b mod a>a mod (b mod a) and C b mod 0jQ mod (fc mod a) 
and we continue in this fashion. In other words, we put c\ = b, c 2 = a and 
Cj+i = Cj_i mod Ci and compute the staircases Sc i)Ci+1 and C CuCi+1 recursively, 
until we arrive at the case S Cj _ lt i and C Cj _ 1: i which we can solve directly by 13.91 3. 
That we arrive in this case eventually follows by the correctness of the Euclidean 
Algorithm and the assumption that gcd(a, 6) = l! Note also that this recursion 
terminates after few iterations. This is made precise in the following lemma. 

Lemma 3.10. Let a, b G N and let (c n )„giM denote the sequence defined by c\ = b, 
c 2 = a and Ci +2 = Cj mod c,_|_i. Then min{j G tKl | Cj+\ = 0} G O(loga). 

Proof. (c,),>2 is monotonously decreasing for all a, b & M, as by definition c^+2 = 
Ci mod Cj+i < Thus Ci div c^+i > 1 for i > 2 and so 

Cj = (cj div c i+ i)c l+1 + mod c m ) > c i+ i +c i+2 > 2c i+2 
v v ' v v ' 

>1 =C i + 2 >Ci + 2 

for i > 2. Hence c i+2 fe < 2 _fe Ci, and so if k > log 2 Cj, then Ci + 2k < 1- in particular 
the minimal j such that Cj+i = satisfies j < 21og 2 c 2 + 2 G O(loga). □ 



Recursive Description of Parallelepipeds. Instead of describing the infinite 
set of lattice points in an entire staircase, one might want to describe finite subsets 
thereof, for example the set of lattice points in only "one period" of the staircase. We 
now give a recursion for the set of lattice points in the fundamental parallelepipeds 
of the cones cone ((g) , ( J)) and cone ((£), (-i))- 
The cone generated by v\ , . . . , v n G R d is the set 



cone(«i, ...,v n ) 




< a; G R for all 1 < i < n 



A cone is rational if all the v% are rational and it is simplicial if the v% are lin- 
early independent. The fundamental parallelepiped Ilconefvi .,«„) of a simplicial 
cone cone(t>i, . . . , v n ) is defined as 



hlcone(i)i ,. . . ,v n ) • \ ^ 



CtiVi 



< a, < 1 for all 1 < i < 



Note that any rational cone cone(i>i ,v n ) C R m can be transformed unimodularly 
to a rational cone cone(erej, v[, . . . , v' n ) with a G {+, — } and 1 < j < m. So we don't 
restrict ourselves by only looking at cones containing e\ or — e 2 in the generators. 
With the above notation 

v a , b n[o,a)xR = n one(( _ 0i)a)) nz 2 
H a , h nRx[o,6) = n one(( i M a )} nz 2 , 

see Figure El This means that if a < b (and hence S a _ & = H ab ), the points z in 
the staircase S a ,b with < z 2 < b are just the lattice points in the fundamental 
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parallelepiped II r{i \ (a\\. The corners z € C a b with < Z\ < a are just the 

cone Uo )'\b l) 

lattice points in the fundamental parallelepiped II // \ /a\v 

COne \\-l)'\b)) 

To give an interpretation of our recursion in terms of fundamental parallelepipeds 
it is convenient to define the set II° one ^ i v ■, of lattice points (!) in the open 
fundamental parallelepiped of cone(t>i, . . . , v n ) as 



CXiVi 



< a,j < 1 



Note that if n = 2 and both v\ and v-i are primitive, then n° one( . )Ji * U {([j)} = 
Z 2 n n cone (. Ul I , 2 ). So it suffices to give a recursion for the sets of lattice points in 
open fundamental parallelepipeds. 
We are going to use the following abbreviations: 

:=n cone((i' 1 ),(?)) U U,b : = n c°one((i' 1 ),(«)) 

: = n cone((l),(£)) ^,a. b ■= Kone(( J ),( g )) 



In terms of open parallelepipeds, Lemma 13.91 can now be phrased as follows. An 
example illustrating the somewhat involved expression in l3.11l l is given in Figure[7] 



Lemma 3.11. Let aieN and A = 



1 

b div a 1 



1. If < a < b and gcd(a, b) = 1, then 

H°,a,6 ~ ^n°,o,6 rood a 
n^, a ,6 = ^n°,o,6 mod a + {( ))•••)( -(6 div a) + l )} 
U AI\°^ a b mod a + ( _b div a ) 

u (?) + {(_° 1 ),...,(_ ;)d iVQ )}- 

2. n^ , 6 =x\n| Aa + (?) andnj t0)b =x\n!, Ao + (g). 

5. IC, lOll =0 and IIJ i0il ={(5),... ,(V)>- 
This allows us describe 11° a b in terms of W i a b „ lod a and n^ 0;6 mod a . The proof 
is similar to the one of Lemma 13.91 and we omit it for brevity. 
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Figure 7: This figure illustrates the formula given in 13.111 1. Ill 5 13 is expressed in 
terms on II? 5 3 (shown) and III, 5 3 (the corners of II? 5 3 ). The idea is the same 
as in Figure [5] and Lemma 13.91 However there is one important difference: Both 
col5(IIJ 5 3 ) and 0015(111, 5 3 ) are empty. These have to be added using the third 
tcrm( 1 V) + {(_° 1 ),(_° 2 )}. 



Recursive Description of Triangles. We conclude this section by giving a 
similar recursion for triangles, see Figure [H We write 

A a , fc :=conv{(8),(8),(g)} 

and 

A' Qjb :=A a , fe \conv{(° ),(£)} 
to denote closed and half-open triangles, respectively. The corresponding lattice 
point sets are denoted by T af> := A a . b n Z 2 and T' a b := A' ab n Z 2 . 

The idea is now that for < a < b the triangle A a _ b can be decomposed into two 
parts A^ ( b div a ) a and AA a ^ mod a- The former is defined by a line with integral 
slope and hence the set of lattice points T' a , b div a ^ a is easy to describe. The latter 
can be transformed into Ab mod a,a and we can obtain a description of the lattice 
point set T b mod a.a recursively. See Figure [H The resulting recursion is given in 
Lemma 13.121 without proof. 



Lemma 3.12. Let a, b G N and A 

2. T a ,b=X\T b .a + ( a b )- 

3. To,i = {(*) | 0<fc<a}U {(?)}. 
4- Ifk e N, then 



1 

b div a 1 



T'a.ka — [J {(o) ' ■ ' ' ' (ifc-l)} 
0<l<a 

= {(°)|meN} + {(8),(5),...,(g)} 

\({(°)|meN} + {(0),a),...,(^)}). 
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Figure 8: Similarly to our recursive description of S5 ; i3 (see Figure[5|), we can apply 
Lemma 13.121 recursively to partition As^ into triangles with integral slope. The 
different shadings indicate which triangle the different regions correspond to. 



The advantage of using the second expression for T' a ka in 13. 121 4 will become clear 
in Section [5] where we use it to obtain a short rational function representing the 
generating function of the set of lattice points inside T' a ka . Note that we obtain 
a recursion formula for T' by replacing every occurrence of T in 13. 121 1 and 13.121 2 
wit h T' and replacing [332 3 with T' al = {(§)|l<fe< a}. 

In |KNA94| Kanamaru et al. use a recursive procedure as in Lemma 13.121 to give 
an algorithm to enumerate the set of lattice points on a line segment. They go 
on to give an algorithm that enumerates lattice points inside triangles using the 
transformation A, however in this case they do not apply recursion and do not 
mention the partition given in Lemma l3.12U . This partition however is observed by 
Balza-Gomez et al. in [BGMM99 . But as they are interested in giving an algorithm 
for computing the convex hull of lattice points strictly below a line segment, they do 
not work with the full set of lattice points T a j,. In both cases no explicit recursion 
formula such as Lemma 13.121 is given. 



4 Characterizations of Sturmian Sequences 

In this section we state several characterizations of Sturmian sequences of rational 
numbers, i.e. sequences of the form B Q b (or equivalently B a b with < b < a). Wc 
will first motivate each characterization in a separate paragraph without proofs and 
then summarize them in Theorem 14. II The proof of the theorem occupies Section[5] 



Recursive Structure. The most important characterization of Sturmian se- 
quences for our purposes is a recursive one. It is based on the concept of reduction 
presented in Section [3] that relates B a b to i? Q b mod a in a way reminiscent of the 
Euclidean Algorithm. 

We first present the idea informally. Let < a < b and consider the sequence Bb. a 
and the related staircase Sb, a - An interval of Bb, a of the form 10 . . .0 of length k 
corresponds to a corner c £ Sb, a and k — 1 points in Sb, a at the same height as c. Wc 
call a maximal interval of the form 10 ... a block. A block of Bb. a corresponds to a 
row of Sb, a - If the block has length k, the row contains k points. The block sequence 
m{Bb,a) of Bb, a is the sequence of block lengths of Bb, a - By the above observation 
the block sequence of Bb jQ is the row sequence of Sb, a an d by Lemma 13.91 2 and 
Corollary 13. 51 the column sequence of X\>?6.a = S a _b up to shift. This means that 
m (Bb, a ) = B a b is Beatty and hence m(Bb ia ) = B a b mod a is Sturmian. Sec Figure[5] 
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for an example. It turns out that this gives a recursive characterization of Sturmian 
sequences: a sequence s is Sturmian iff m(s) is balanced and m(s) is Sturmian. 




100101001010 3 2322 32322 10100 



Figure 9: Recursive structure of Sturmian sequences. This figure shows several 
staircases S a ,b and the corresponding parts of the associated Bcatty sequence B a b- 
In each picture the first and last column correspond to the same element of B a j, 
modulo the minimal period. The first picture shows S'12,5 and i?i2.5- -B12.5 records 
which columns contain corners, so the block sequence m(-Bi2,s) is just the row 
sequence of 5*12.5. Reflecting at the main diagonal turns rows into columns, so 
m{B\2,5) is the column sequence of \ 512,5 = 5tJ~ 12 , which is shown in the second 
picture. Note that in the first picture the part of the staircase that we show was 
chosen such that points from the first and the last column lie on the defining line 
- and this property is preserved under reflection at the main diagonal. Applying 
the reflection at the origin gets us to X\5i2,5 = 5512, which is shown in the third 
picture along with its column sequence -65,12- The reflection at the origin, however, 
reverses the column sequence. Fortunately Corollary 13.51 tells us that Sturmian 
sequences are invariant under reversals - up to a shift. This shift in the column 
sequence can be seen from the fact that in the third picture, the defining line does 
not pass through points of the first or last column anymore. From the first three 
pictures, we see that the column sequence of 55,12 is just the row sequence of 5i2,5 
up to shift, i.e. m{B\2^) = i?5,i2- Now we can apply reduction. We pass from 55,12 
to 55,2, as shown in the last picture, which gives us m(i?ia s) = #5,2- This is a 
geometric illustration of the combinatorial fact that if s is Sturmian, so is m(s). 

To make this precise we give the following definitions. Suppose s = (s n ) ne z is a 
periodic 0,1-sequence with P(s) = a and l(s) = b. Without loss of generality we 
can assume that sq = 1. Let i\, . . . , % 6 {0, . . . , a — 1} be the indices of the Is, 
i.e. let i\ < ii < ■ ■ ■ < % with = 1 for all 1 < j < b. Put ib+i ■= a. Then 
the j-th block in (so, . . . ,s a -i) is s\u and the length of the j-th block is 

rrij := ij+i —ij- The block sequence m(s) is the infinite periodic sequence generated 
by (mi, . . . , mb). Note that this definition determines m(s) only up to shift, which 
suffices for our purposes. On equivalence classes of sequences up to shift, m is an 
injective function, i.e. Si = S2 to(si) = m{s2). We call a sequence block balanced 
if it is balanced and its block sequence is balanced. In this case we can consider 
the reduced block sequence m(s) which is again a 0, 1-sequence. A sequence s is 
recursively balanced 

• if l(s) = 1, or 

• if s is block balanced and m(s) is recursively balanced. 
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The characterization now is this: 



A periodic 0,1-sequence is Sturmian if and only if it is recursively balanced. 

Even Distribution of Os and Is. Common sense suggests that, as the staircase 
approximates a line, the Os and Is of the Sturmian sequence should be distributed 
as evenly as possible. The actual number of Is in every interval should be as close 
as possible to the expected number of Is. This can be made precise in the following 
way. On an interval of length I, a line with slope ^ increases by -I. So the expected 
number of Is in an interval of length I of £ a b is H, if b < a, and b ™ da l in general. 
As b m ° d a I is in general not an integer, the best that can be hoped for is that for 
every interval I of length I the number of Is contained in I is either I b m ° d a l\ or 
[~ - m ° d a f\ , and indeed this is a necessary and sufficient characterization of Sturmian 
sequences. Formally, we say that the Is in a periodic 0,1-sequence s are evenly 
distributed if for every interval sl^^] 



(4.1) ones(s|r X0)Xl] ) G 



l(s). 

— Iength(«| [a[0)ai] 



jj^lengthW [80i!Bl] ) 



Note that if z G Z and r G R, then z G { [r\ , |>] } if and only if z — 1 < r < 2 + 1, 
so the condition 

(4.2) ones(s| [x0iXl] ) - 1 < ^length^^) < ones(s| [a;0::ri] ) + 1 

is equivalent to (|4.ip . If an interval s|lj. >Xl ] violates the left-hand inequality, then 
we say it contains too many Is and if it violates the right-hand inequality, we say 
it contains too few Is. The characterization, then, is this. 

A periodic 0, 1-sequence s is Sturmian if and only if the Is in s are evenly 

distributed. 

This characterization appears in |GLL78j and was later improved in |Fra05j . 

Symmetry. A different way to phrase that the Os and Is are distributed evenly 
would be to state that Sturmian sequences are symmetric. If symmetric is taken to 
mean invariant under reversals (up to shift), then this is a true statement (Corol- 
lary [33J - but insufficient to characterize Sturmian sequences. 
However, Lemma 13.21 suggests a different notion of symmetry. If we start with a 
flat staircase S a ,b with < b < a and move the defining line downwards by a small 
amount, the resulting staircase S a ,b,r will be a translate of S a ,b- Hence their column 
sequences are identical up to shift. But using Lemma l3"Tl we see that if > r > — ^ 
the only columns that differ are coh a for k G Z: the single point in these columns 
has been moved down by one. The columns coU a do not contain a corner anymore, 
whereas the columns colfeo+i do. In the Sturmian sequence, this translates to taking 
the corresponding interval 1,0 and replacing it with the interval 0, 1. 

This observation gives rise to the notion of swap symmetry. A periodic 0, 1-sequence 
s is swap symmetric if there is a pair (sj, s;+i) = (1,0) such that if we replace this 
pair and all periodic copies of it by (0, 1), we obtain a sequence s' that is identical to 
s up to a shift. Sec Figure [TU] for an example. Formally, given a periodic sequence 
s and i G Z, we define the sequence swap(s,i) := (swap(s, i) n ) n by 



swap(s, i) r 



s n — 1 if n = i mod P(s) 
s n + 1 if n = i + 1 mod P(s) 
s„ otherwise 
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0, 1, 0, 1, 1, 0, 1, 0, 1, 1, 1, 1, 0, 1, 0, 1, 1, 0, 1, 0. 

^ J V J 

Y Y 



Figure 10: Swap symmetry of Stumian sequences. Consider the staircase £5,3 and 
the corresponding Sturmian sequence #5,3, which records the columns of 65,3 that 
contain a corner. If we move the line defining downwards by a small amount, 
then only those columns change in which there was a point on the line £5,3. These 
columns do not contain corners any more - the corners move one column to the 
right. So in the sequence B^^ this corresponds to replacing an interval 10 with 
01, i.e. by swapping a 1 and a 0. The digits that are swapped arc highlighted in 
the figure. Now, by Lemma [3T21 translating the defining line only shifts the column 
sequence of C5 3. So the sequence obtained from £5,3 by swapping is again B 5 3 
up to shift. This is shown by the braces, which indicate minimal periods of both 
sequences. 



We call a periodic sequence s swap symmetric if there exists an i £ Z such that 
s = swap(s,i). Note that if s = swap(s,z) then 

s, = swap(s, i) i+ i = s i+ i + 1 
Si+i = swap(s, i)i = Si-1 

as the number of entries < k < P(s) such that Sk = c cannot change under a swap 
for any constant c G N, for swap symmetric s. 
This property characterizes Sturmian sequences: 

A periodic 0,1-sequence is Sturmian if and only if it is swap-symmetric. 



Having motivated the three characterizations, we can now state the theorem. 

Theorem 4.1. Let s ~ (s n )nez be a periodic 0,1-sequence with P(s) = c 
l(s) = b > 1. Then the following are equivalent: 

(ii) s is recursively balanced. 
(Hi) The Is in s are evenly distributed, 
(iv) s is swap symmetric. 



(i) =>• (Hi) is easy to prove (see next section) and ^j^j =^ \ appe ars in |GLL78j 



although the concept of "nearly linear" sequences used in |GLL78] differs slightly 
from l4.lK ro). The connection between these definitions^ is made in |Fra05j . where 
the result from [GLL78] is extended. In both cases the focus lies on the more general 

1 In Fra05 sequences with an even distribution of Is are called "balanced". 
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case of lines with irrational slope. The proofs given in these two sources differ from 
the proofs we present in Section [5] 

As far as we know the concepts of recursively balanced and swap symmetric se- 
quences do not appear in the prior literature. 



5 Proof of the Characterizations 

We first show the recursive characterization, i.e. that a periodic 0,1-sequencc is 
Sturmian if and only if it is recursively balanced. To that end we first prove two 
lemmas. 

Lemma 5.1. IfO<a<b, then m(B b:a ) = Ba t- 
Proof. Let < a < b. A maximal interval of B ba of the form 1, 0, . . . , corresponds 
to a row of >f?6 )0 , so 

m(B bia ) = (|row„(S' fc , a )|)„ = (|col„ (£"(,)!)„ = (|col„(5 a , fc )|)„ = B a<b 
where we use Lemma 13.31 in the second and Lemma 13.21 in the third step. □ 



From this we also get that for a > b (i.e. flat staircases) the sequence m(B ab ) is 
balanced, and thus Sturmian sequences are block balanced. 

Lemma 5.2. If s is a block balanced 0, 1-sequence, then s is Sturmian if and only 
if m(s) is Sturmian. 

Proof. As s is a block balanced 0, 1-sequence 

s = B 0ja for some < a < b 
m(s) = B a o for some < a < b 
<^> rn(s) = B Q b for some < a < b 

where the first equivalence holds by Lemma lS.ll and the fact that m is injective and 
the second equivalence holds by Lemma 13.61 □ 



Proof of Theorem \4-l\\(i)\ (ii) The proof is by induction on l(s). If l(s) = 1, 



the statement holds. For the induction step, we have the following equivalences: 

s is recursively balanced 
<^> ,s is block balanced and 

m(s) is recursively balanced 
<^ s is block balanced and 

m(s) is Sturmian 
<^> ,s is block balanced and 

s is Sturmian 

s is Sturmian 

Here we use Lemma 15.21 in the third step. Note that the induction terminates in 
the case l(s) = 1 since if l(s) > 1, then s has blocks of different sizes and so 
l(s) > l(m(s)) > 1. □ 

Now we turn to the proof of the characterization that a periodic 0, 1-sequence s is 
Sturmian if and only if the Is in s are evenly distributed. One direction is easy to 
show. 
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Proof of Theorem \4- l\\(Tj\ (Hi) Let B a ^ be a Sturmian sequence (i.e. a > b) and 



let B a ^\[ Xo Xl ] be any interval of i? a ,6- Using (|2.ip and the fact that 
^(xi - ar + 1) = ^7^ylength(B Oj6 | [a . 0>a!l] ) 



we obtain 



(5.1) ones(-B ,(,| [x0iXl] ) = ^^ length(-B a , b | [g0 , Xl] ) + fyxo - 1)| - j^xi 

So, since < {^(^o — 1)} < 1 an d < < 1 and both terms appear in ([5 

with opposite signs, 

ones(B O)6 | [a . 0ia!I ]) - 1 < ^|^|length(B ai6 | [a . 0iXl] ) < ones(S aj6 |[ XOia!1 ]) + 1. 



□ 



To show the other direction, we make use of the first characterization. We show 
that if the Is in s are evenly distributed, then s is recursively balanced. 



Proof of Theorem \4- => (ii) Let s be a periodic 0,1-sequence in which the Is 
are evenly distributed. We use induction on l(s). If l(s) = 1, then by definition s 
is recursively balanced. For the induction step, we assume l(s) > 1 and show that s 
is block balanced and the Is in m(s) are evenly distributed. Then we can apply the 
induction hypothesis to obtain that m(s) and hence s is recursively balanced. 

Step 1: s is block balanced. If there were blocks of zeros in s that differed in length 
by at least two, then we could find intervals u and v of the same length I, such 
that u contains two Is and v contains none. But then {0,2} C { -p^y?J , p^y^ }j 
which is impossible. So s is block balanced at some k G N, m(s) is well defined and 
the following identities hold. 

P(m(s)) = l(s) 
l(m(s)) = P(s) - kt(s) 

Step 2: The Is in m(s) are evenly distributed. Briefly, the idea is this: if m' is an 
interval of m(s) =: m that has too many Is, looking at the corresponding interval in 
s we will find many large blocks (i.e. many 0s) and so we can construct an interval s" 
of s that has too few Is. This gives a contradiction to the assumption that the Is 
in s are evenly distributed. 

Let m' = m(s) | u. 0/ri i be an interval of m(s). We have to show that 

1(to) 

(5.2) ones(m ) - 1 < ^^ylength(m') < ones(m') + 1. 



Assume to the contrary that m! violates ([52 
We first argue that without loss of generality 

(5.3) ones(m') - 1 > lM| en gth(m'), 

P(m) 

i.e. that m! contains too many Is. Suppose ml contains too few Is, i.e. ones(m') + l < 
length (to'). Then choose X2 > x\ such that the length of the interval rn\^ XQX ^ is 
a multiple aP(m), a £ N of the period length. Then ones(m|[ xo iX2 i) = al(m) as m 
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is periodic. Now the interval m|[ a;i _|_ 1 . E2 ] has too many Is, i.e. ones(m|[ a . 1+1;r2 ]) — 1 > 
length (m|[ a , 1+ i |iE2 ]), as witnessed by the following computation: 



ones(m|[a, 1+ i ja;il ]) = ones(m|[ X0ia;2 ]) - ones(m') = al(m) - ones(m') 
at(m] 
l(m) 



„ . l(m) . . , is 
> at(m) - -7— ^length(m') + 1 
P(m) 



P(m) 
l(m) 
P(m) 



(aP(m) - length (m')) - 
length(m|[a; 1+ i )X: ,]) + 1. 



Each element of m' corresponds to a block of Os in s, where we take the block 
to include the preceding 1 but not the succeeding 1. Taking all the blocks in s 
together that correspond to elements of m' we obtain an interval s' = s|[ y0jyi ] of s. 
Let s" = slfj/o+ijj/!] denote the interval obtained from s' by removing the first 1. 
Then the following identities hold. 

length(m') = ones(s') = ones(s") + 1 
ones(m') = length(s') - k ones(s') = length(s") + 1 - fc(ones(s") + 1) 

By substituting these and the identities obtained in Step 1 into (|5.3[) we obtain 

lengthfs") + 1 - fe(ones(s") + 1) > ~ kt ( s ) ( ones (,s") + 1) + 1 

l(s) 

which by canceling terms implies length (s") > 0|f(ones(s") + 1) and therefore 

ones(s") + 1 < ■p^jlength(s"). This means that s" is an interval in s with too 
few Is, contradicting the assumption that the Is in s are evenly distributed. □ 

Finally, we turn to the characterization that a periodic 0,1-sequence is Sturmian if 
and only if it is swap symmetric. In Section 2] we have already tried to motivate 
that Sturmian sequences are swap symmetric, and the proof indeed proceeds as 
suggested by Figure [POl 



Proof of Theorem \4-l\\(i)\ (iv), Let < b < a and let s = B a ^ = (|col n (C Q ,(,)|)„. 
We claim that 

swap(s,0) = (|co] n (C tt)6) _i)|) ri = (|col„(C 0)6 )|)„ = s 

which completes the proof. The equivalence in the second step holds by Lemma |3. II 
All that is left to show is why the first equality holds. 

To this end we argue as follows (see Figure [TU]): First we observe that shifting the 
line down by — - only changes those columns col x (S a! b) with x mod a = 0. More 
precisely col x (S a b _±) = col x (S a ,b) - (?) if = a; mod a and coLj(5 0j6j _i ) = 
coL(S' a ,b) otherwise. Now we observe that a point v in a flat staircase is a corner 
if and only if v — e\ is not in the staircase. As we know which columns changed, 
and that a > 2, this allows us determine where the corners are after the shift. If 
x mod a = 0, then |col x (C ajhi _i)| =0. If 2; mod a = 1, then |coLj(C a b _ 1 )| = 1. 
Otherwise co\ x (C a b _±.) = co\ x (C a ^ b ). □ 

To show that swap symmetric sequences are Sturmian, we first prove two lemmas. 
Note that in Lemma 15.31 we do not claim that if s is swap symmetric, then m(s) is 
balanced. However we can show that m(s) is swap symmetric^ We then observe 



2 Our definition of swap symmetry was phrased such that it can be applied to arbitrary periodic 
sequences. 
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in Lemma 15.41 that swap symmetric sequences are necessarily balanced and hence 
m(s) is balanced. 

Lemma 5.3. Let s be a periodic 0, l-sequence. If s is swap- symmetric and l(s) > 1, 
then m(s) is swap- symmetric. 

Proof. Wc know that P(m(s)) = l(s) > 1. Let < i < P(s) be such that 
swap(s, i) = s. Then Sj = 1. Say the block preceding Si is the j-th block of s. Swap- 
ping at i makes all the fc-th blocks of s with k = j mod l(s) larger by one and all 
the k-th blocks of s with k = j + 1 mod l(s) smaller by one while leaving all other 
blocks unmodified. As P(m(s)) > 1, this means that swap (to (swap (s, i)), j) = m(s) 
but by assumption swap(s,z) = s, so there exists an / £ Z such that 

swap(m(s), Z) = swap(m(swap(s, i)), j) = to(s) 

which means that m(s) is swap symmetric. □ 

Lemma 5.4. 7/s is a periodic swap- symmetric sequence, then s is balanced. 

Proof. Assume to the contrary that s contains at least three different entries. Let 
< i < P(s) be such that swap(,s, i) = s. Then Si ^ Sj+i- Let a = s, and b = Sj+x- 
Let c <E Z be such that there exists a j e Z with Sj = c but a ^ c ^ b. We now 
define the parameter d(s), which is the sum of the distances of any occurrence of a 
in period(s) to the closest preceding occurrence of c in s, i.e. 

d(s) := k — max {I < k \ si = c} . 

0<fc<P(s),s fc =a 

Note that if s = s', then d(s) = d(s') as shifting a sequence to the left or right 
does not affect the distances between occurrences of values. Now the swap at i 
interchanges the a at position i and the b at position i + 1, which increases the 
distance of this occurrence of a to the previous c by 1 and leaves all other distances 
of an occurrence of a to a previous c unaffected. Hence c?(swap(s, i)) = d(s) + 1 and 
so swap(s,i) ^ s, which is a contradiction. □ 

After these two lemmas, the proof that swap symmetric sequences are Sturmian is 
easy. Again we proceed by showing that swap symmetric sequences are recursively 
balanced. 



Proof of Theorem \4-l\ (iv) (ii) Let s be a periodic 0, l-sequence that is swap 
symmetric. There is an index i at which we can swap, so l(s) > 0. If l(.s) = 1, s 
is recursively balanced by definition. So we can assume l(s) > 1. By Lemma 15.31 
it follows that m(s) is swap symmetric. By Lemma 15.41 it follows that m(s) is 
balanced. Taking both together we conclude that s is block balanced and that to(s) 
is well defined and swap symmetric. By induction we infer that to( s ) is recursively 
balanced. But if s is block balanced and to(s) is recursively balanced, then s is 
recursively balanced. □ 



6 Application: Short Representations 

It is a celebrated result by Barvinok |Bar94] that there is a polynomial time al- 
gorithm for counting the number of lattice points inside a given rational polytopc 
when the dimension of the polytope is fixed. Note that if the dimension is an in- 
put variable, the problem gets A^P-hard |GJ79j . For more about the algorithm see 
}DL05j . |DLHTY04] and the textbook |Bar08j . 
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The crucial ingredient of Barvinok's proof was his result that the set of lattice 
points in a simplicial cone of any fixed dimension can be expressed using a short 
generating function. In this section we give a new proof of this result for the special 
case of 2-dimensional cones (Theorem 16. A generalization of our proof to higher 
dimensions is not immediate, we hope, however, that such a generalization can be 
found in the future. 

We consider the Laurent polynomial ring K[xf , . . . ,x^]. For a vector m = 
(mi, . . . , md) £ Z d we write X . — X ^ ... X j This gives a bijection between 
l_ d and the set of monomials in K[xf , ...,x^\. We can thus represent the set of 
lattice points in a polyhedron P by the generating function fp(x) = J2mez d np x ™ ■ 
If P (~1 Z d is large, this representation of fp contains many terms. Using rational 
functions it is possible to find shorter representations of fp. For example the gen- 
erating function of all non-negative integral multiples of a vector m can be written 
as 1 J L xm , which allows us to express point sets like {0, m, . . . , km} as . 
Developing these notions in detail is beyond the scope of this article. As references 
we recommend jBR07j and |Bar08| . However, we would like to point out, informally, 
how the algebraic operations on generating functions correspond to geometric oper- 
ations: The sum of generating functions corresponds to the union of the respective 
sets. The product of generating functions corresponds to the Minkowski sum of the 
respective sets. Taking the product of a generating function and a monomial x m 
thus corresponds to translation by m. Evaluating the generating function fp(x) 
at the values x mi , . . . , x mi for mi, . . . , m^ £ Z d corresponds to applying the linear 
map given by the matrix A = (mi . . . md) that has the mm as columns to the set P. 
As we already mentioned, for every 2-dimensional rational cone K in R 2 there 
exists a lattice transform A such that AK = cone ((J) , (&)) for a, b S N with 
gcd(a, b) = 1. Barvinok showed a general version of the following theorem for cones 
of any dimension. We are going to give a new proof of this version for 2-dimensional 
cones (recall the definitions from the end of Section [5]) . 

Theorem 6.1. Let a,b <E N with gcd(a,6) = 1. Let K = cone (( \ ) , ( g )). Then 
Jk admits a representation as a rational function with C(log a) terms and this 
representation can be computed in time polynomial in log a + log b. 

Proof. Step 1. We express fx in terms of /a' • To this end we first note that 
cone((S),(g))= |J*(6) + ( cone ((o).(S))nRx[0 J &)) 

k>0 

and 

Z 2 ncone((i),(g))nRx [0,6) = {(° Q )}UT a ^ b 

U({(j) \i>a + l} + {{°) | < j < 6 - 1}) . 

See Figure 1111 In terms of generating functions this translates into 

Here we express fx using and a constant number of other terms. So it suffices 
to give a short expression of f&> . 

Step 2. We use the recursion from Lemma T3. 121 to give a short expression for f^i . 
Let {c n ) n be the sequence defined by c\ — 6, C2 = a and c;+2 = c,; mod Cj+i and 
let j be the index such that cy+i = 1 and cj + 2 = 0. We express /jv x c . m terms 
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Figure 11: Expressing the lattice points in a cone in terms of triangles. The right 
picture shows the occurring shapes (dashed lines indicate open faces). 



of / T , , by applying first Ell and then[5H2 and[3H4. 

y / \ r f Cj div c i+1 \ 

m t+ir \xi,x 2 ) = m t+1 , H+2 (xix 2 + ,x 2 ) 

+ t T c <c dive >c (^l)^) 



Ci+1 Ci+i 



c i + 2. c i + l 



/ —1 —1 -(Cjdiv C, + i)> 

(x 2 , x 1 x 2 ) 



'l _ c.+i+l -, _ Ci + i + 1 (ci div c i+ i)(ci + i + l) ' 



1-.t 2 ^ 1-an l-aix? dive4+1 / 

We have thus expressed /jv e using a constant number of other terms. We 
proceed in this fashion until we reach the case fx> = /t' which we can solve 
directly using [XT2l 3: 

h{ B . = xi- . 

5<ep 5. The expression is short and can be computed in polynomial time as the 
Euclidean Algorithm is fast. By Lemma [3.101 the number of iterations required in 
step 2 is O(log a). In each step we pick up a constant number of terms. So the total 
number of terms in the final expression is (D(loga). The algorithm runs in time 
polynomial in log a + log b as the numbers = <k mod a + i and Cj div Cj+i can 
be computed in time polynomial in logQ + logCj+i. □ 

This proof of Theorem 16.11 differs from Barvinok's. Barvinok gives a signed decom- 
position of a cone into unimodular cones. We give a positive decomposition of the 
triangle T' a b into triangles T' c div c +1 ) c +1 that are not unimodular but easy to 



describe, i.e. using a constant number of terms. 
In this context "positive" means that the 2-dimcnsional triangle T' a b is written 
as a disjoint union of half-open 2-dimensional triangles T' a b . This does not mean 
that the numerator of the rational function has only positive coefficients. Negative 
coefficients appear in the "easy" description of the triangles T' c ^ ^ div c +1 ) c +1 - 

Lemma 13.111 can be used to obtain a short representation of the generating func- 
tion of the lattice points in the fundamental parallelepiped of any rational cone 
in the plane. We implement this idea in the proof of Theorem 17.11 in Section [7] 
This representation can also be used to give an alternative proof of Theorem 16.11 
Again the representation is positive in the sense that the set of lattice points in 
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the fundamental parallelepiped is expressed as a disjoint union of Minkowski sums 
of intervals. But of course still negative coefficients appear as they appear in the 
representation of intervals. As opposed to the representation based on triangles, the 
representation based on fundamental parallelepipeds relics on taking products; so 
with this approach expanding the products in the numerators leads to an expression 
that is not short any more. 

It is also possible to give a recursion similar to 13 . 91 [3~TT1 and [3.121 directly for cones. 
However, in this case the recursion does require us to take differences of sets and we 
do not obtain a "positive" decomposition. Nonetheless the recursion differs from 
the one based on the continued fraction expansion of £ given in [Bar08[ Chapter 15]. 



7 Application: Dedekind-Carlitz Polynomials 

Given < a, b G N with gcd(a, b) = 1, Carlitz introduced the following polynomial 
generalization of Dcdekind sums, which Beck, Haasc and Matthews in |BHM08| call 

the Dedekind-Carlitz polynomial: 

a-1 

c a , b (x,y) :=Y^x k - x y^ k l 

k=l 

For a brief overview of the history of and literature about Dedekind sums and 
the Dedekind-Carlitz polynomial, we refer to |BHM08j . There also the relation- 
ship between Dedekind-Carlitz polynomials and the fundamental parallelepipeds of 
cones (see below) is established. Appealing to Barvinok's Theorem, Beck, Haasc 
and Matthews conclude that the Dedekind-Carlitz polynomial can be computed in 
polynomial time and must have a short representation^ however they do not give 
such a short representation explicitly. Also they remark that Dedekind sums can 
be computed efficiently in the style of the Euclidean Algorithm and ask if such a 
recursive procedure also exists for Dedekind-Carlitz polynomials. In this section we 
use the recursion for the lattice points inside a fundamental parallelepiped devel- 
oped in Section [3] to give an explicit recursion formula that allows one to compute 
short representations of Dedekind-Carlitz polynomials in the style of the Euclidean 
Algorithm. 

We first observe that c a ,b is the generating function of the set 

o) = n °> a > b ~ (o 

which is just a translate of the set of lattice points in the open fundamental pa- 
rallelepiped nj a b . Hence the recursion given in Lemma [3.111 can be used to give 
a recursion formula for Dedekind-Carlitz sums in the spirit of the Euclidean Al- 
gorithm. To this end, we use d ab (x,y) and d^ b (x,y) to denote the generating 
functions of the sets n? a b and a b , respectively. So 







b 




jzez 2 


Z\ = 




, 1 < zi < a 






a 



k=l 
5-1 



y k . 



k=l 



Now, by simply translating the geometric operations into the language of generating 
functions, we obtain the following theorem. In |BHM08j this result is derived from 
Barvinok's Theorem. We give an explicit recursion formula in the proof. 

3 In |BHM08l this is argued even for higher-dimensional Dedekind-Carlitz polynomials. 
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Theorem 7.1. Let < a, b <E tKl uwi/i gcd(a, b) = 1. T/ien c a ,b admits a repre- 
sentation as a rational function with O(loga) terms and this representation can be 
computed in time polynomial in log a + log b. 

As was said before, the representation we obtain is "positive" in the sense that we 
build a partition of the set IT J a b using Minkowski sums and disjoint unions of inter- 
vals. It is not positive in the sense that all coefficients appearing the representation 
are positive, as the representations of intervals that we use contain coefficients with 
opposite signs. 

It is important to stress that the representation we obtain makes heavy use of 
Minkowski sums of intervals. In the language of generating functions, this corre- 
sponds to taking products of expressions of the form ^Ett; for fc 6 N and ueZ 2 . 
Expanding the numerators of these products by applying the distributive law may 
lead to a numerator with a number of summands exponential in the number of 
factors of the product. So the expression we obtain is only short, if products are 
not expanded. We note that this problem does not occur with the representation 
we used in the proof of Theorem 16.11 

Proof. First we note that c a ,b(x,y) = x~ 1 d^ b (x, y). Now wc construct a short 

representation of d^ b (x,y) by applying Lemma 13.111 inductively. To that end let 
(c n )n be the sequence defined by c\ = b, c-i = a and = c% mod Cj+i and let j 
be the index such that Cj + \ = 1 and c.j + 2 = 0. Such a j exists because gcd(a, b) = 1. 
By Lemma 15.111 2 we can assume without loss of generality c\ > c 2 . Then for all 
i > 1 

4 i+UCi (x,y) = < +1 , Cl+2 (^ divc ' +1 ,?/) 

1 — ir^ Ci div c «+0 

+y -( Ci w^) d ^ +uCi+2t( xy c * divc >+\y) 
+x a y bV - y — 

i - y 1 

and 

^.ci+iO^y) ^VCflb'- 1 ) and d ii,c i+l ( x ^y) = xa y bd ^ +1 . Ct (-y^- x )- 

Together with 

dT r ■ , , (x, y) =0 and d\. . r .,,(x,y) = 

this gives us a recursion formula for b (x, y). In each step of the recursion we pick 
up only a constant number of terms and by Lemma [3.1 01 wc need only C(log a) steps, 
so the resulting representation has only O(loga) terms. As standard arithmetic 
operations can be computed in time polynomial in the input length, the algorithm 
runs in time polynomial in log a + log b. □ 

Note that by using products, one can give a representation of the lattice points 
in an interval of length n with O(logn) many terms and without using rational 
functions. Using such a representation, the above proof gives a representation with 
C(log 2 a) terms in time polynomial in log a + log b. Moreover this representation 
then is positive in that every coefficient appearing in this expression has a positive 
sign. 



C p .(i,j) = i <r ( xy CiAivCi +\y) 
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8 Application: Theorem of White 



To conclude this paper, we give a partly new proof for a theorem of White |Whi64[ 
pp. 390-394], characterizing lattices tetrahedra containing no lattice points but the 
vertices. Several proofs appeared over the years, e.g. by Noordzij |Noo81| . Scarf 
|Sca85j (based partly on work by R. Howe) and recently Reznick |Rez06j , who also 
gives an overview of the history of this theorem. Furthermore his proof has the 
advantage that it keeps track of the vertices, at the cost of geometric transparency. 
We construct our proof based on ideas in |Sca85j and mainly |Rez06j . 

For (a, b, n) £ Z 3 we define the tetrahedron T a .;,.„ as 




This should not be confused with the point set T a ^ defined in Section[3] As we will 
not use the latter any more, no ambiguities should arise. 

A "hidden" parameter, as Reznick writes, is c = n — a — b+1 (although he considers 
a slightly different c) . We will see that c plays a role equal to the ones of a and b 
in T a ^ >n . Also note that a + b + c = n + l. 

We call two tetrahedra T and T' equivalent (T m T"), if there is an affine lattice 
transformation which takes the vertices of T to the vertices of T'. 

A lattice simplex T is clean if there are no non- vertex lattice points on the boundary. 
If there are also no lattice points in the interior of T (i.e. the vertices are the only 
lattice points), then we call T empty. 

Theorem 8.1 (White). A lattice tetrahedron T is empty if and only if it is equiv- 
alent to To, o,i or to some Tx^ n , where gcd(d, n) = 1 and 1 < d < n — 1. 

Proof of Theorem \8.1\ (Necessity). As we easily see, Xo.o,i is empty. So we consider 
T\ t d,ni where gcd(d, n) = 1 and 1 < d < n — 1. Let w e Z 3 fl Ti t d, n - As the first 
coordinate of all vertices of T\ t d,n is either or 1, we know wi € {0, 1}. 



If w\ = 0, then w e conv | ( o J , ( l ^ j 



and w is a vertex. 

If wi = 1, then w £ conv|Q^ . As gcd(d, n) = 1 the vector (V^j is 

primitive and so w is again a vertex. 

Therefore T\^. n and any equivalent tetrahedron is empty. This proves that lattice 
tetrahedra of the form To.o.i or T\^. n with gcd(<i, n) = 1 and 1 < d < n — 1 arc 
necessarily empty. □ 

To show the sufficiency, we use the following theorem by Reeve. A nice proof can 
be found in |Rez06[ pp. 5-6]. 

Theorem 8.2 ( Rcc57 ). The lattice tetrahedron T is clean if and only ifT w ^0,0,1 
or T « Ta^n, where 

n > 2, 0<a, b < n — 1 and gcd(a, n) = gcd(6, n) = gcd(l — a — b, n) = 1. 

We will now prove the sufficiency to motivate the rest of the section, where we 
anticipate the results that are stated and proved only afterwards. 

Proof of Theorem \8.1\ (Sufficiency). If T is an empty lattice tetrahedron, then in 
particular it is clean and thus by Theorem [521 equivalent to To^o.i or some T a ^^ n . If 
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T ~ To. 0,1, we are done. Otherwise T Qj b,. 
fulfills the conditions for Lemma [ 



T is of course also empty and therefore 



This in turn enables us to use Lemma 18.51 which tells us that one of a, b, c equals 1. 
Finally, we can see by Lemma 18.31 that we can choose the order of the coordinates 
freely, so we get T ~ T\ t d, n , where gcd(d, n) = 1 and 1 < d < n — 1, again by 
Theorem [O □ 



It turns out to be useful to describe a clean tetrahedron T a ^ n in a slightly different 
way: 

Lemma 8.3. Let T a .h.„ be empty and < a, b < n — 1. Then 



Ta h ' 



conv 




and c > 1. 



In the next two proofs we follow mostly |Sca85( pp.41 If]. 
Proof. If n < a + b, then 

T 




where 



«4 



1 - -, 



o 2 



a 3 - a 4 



1 — — , and ai = 1 — Q!2 
But this means that < a\ < 1 and < a2,Q!3,Q;4 < 1, and thus 
contradicting the assumption that T a fi n is empty. 

So we know that n > a + b, which proves c > 1. The affinc lattice transformation 



a+b— n— 1 
n 

£ Ta,fa,n, 












• 


1 








-1 







gives us the desired form for T a ^, n - 



□ 



The key ingredient for the rest of the proof of Theorem [STT] is to look at the Beatty 
sequences for — , — and — simultaneously. For this purpose let us define the sum of 
the sequences, i.e. 

f{k) := S„,a(fc) + B„, b (fc)+B„, c (fc) 



a -k 



n 



°-k 



a -{k-D 
n 



-(fc-1) 
n 



n 



This function has a strong connection to Theorem 18. li as we sec next. 

Lemma 8.4. IfT a j,. n is empty, then f(k) = 1 for k = 2, .. . ,n — 1. 

Proof. Let us first define the function g(k) := \—k\ + \—k\ + \—k \ . An easy 
computation verifies that for a, b, c relatively prime to n and 2 < k < n — 1, we have 
f(k) = g(k) - 3 - (g(k - 1) - 3) = g(k) - g(k - 1). 

We will now show that g(k) = k + 2 for k = 1, . . . , n — 1 and also that a,b,c are 
relatively prime to n. 



Suppose that g{k) < k + 1 for some k. Then we can define a lattice point 
P> 



with 







\ b -k 








, q> 




, r> 


r £ *i 


n 




n 




n 
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and p + q + r = k+ l. But then we find ol\ , 



, 0:4 with 



'p\ /10 0a 
q = 1 6 
t/ \0 1 c 



) 






\0LiJ 



where 



atx=p-%k, a 2 = q-^k, a 3 = r-%k, a 4 = ifc and 

4 



a b c k 

p + q + r k k AH 

n n n n 



k + 1- 
1. 



(a + b + c- l)k 



k + l-k 



As this means that (ji^J is in conv j^o^ '(^)'(j)'( b )} ~ -^Am we have a 
contradiction. 

So we know that g(k) > k + 2. 

If one of a,b,c is not relatively prime to n, say it is c, then ^fc £ Z for some 
fc e {1, . . . , n — 1}. So we get 



g(k)+g(n-k) = 



and by the assumption on c this is at most a + & + c+ 2 = n + 3. But then cither 
.o(fc) < fc + 1 or g(n — A;) < n — k + 1, which cannot be true for an empty T a ^^ n . 

If they are all relatively prime to n, then g(k) + g(n — k) = a + o + c + 3 = n + 4. 
Together with g(k) > k + 2 and g(n — k) > n — fc + 2 we get the desired equality. 

We have now seen that g(k) = k + 2 for fc = 1, . . . , n — 1. Together with /(fc) = 



r £ *i 


+ 




+ 




+ 


a — 




+ 


b- 


b -k 
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c — 


-A 


n 




n 




n 






n 






n 
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r £ *i 
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'h' 


+ 




+ a- 


«k 


+ b- 


b -k 


+ c- 


°-k 


n 




n 




n 






-XI - 






n 






-71 - 



g{k) — g(k — 1) this implies /(fc) = 1 for fc = 2, 
This is all we need to finish the proof: 



. , n 

,n- 



□ 



Lemma 8.5. If /(fc) = 1 for k = 2, . . . , n — 1, then at least one of a,b or c equals 
1. 

This is the new part of the proof. It builds onto the observations about Sturmian 
sequences developed in the first half of this article. In particular we make use of 
the fact that Sturmian sequences are block balanced, and, more generally, that the 



Is in a Sturmian sequence are evenly distributed. See Theorem 14. 1 Viii) 



Proof. Suppose not. Without loss of generality c < a, b. We consider the intervals 

B n> a\[i,n], B n,b\[l,n] and B n,c\[l,n]- As c > 2, there is another 1 in B n ,c|[i,n] a P art 
from _B„. c (n) = 1. Let m be the position of the 1 preceding B„ iC (n) = 1, i.e. 
1 < m < n such that B n ^ c (m) = 1 and B n ^ c {k) = for m < k < n. Because 
/(fc) = 1 for all 2 < fc < n — 1, we know that B n ^ a (m) = B n ^(m) = 0. 
At this point we make a table listing the values of the three intervals at the positions 
1, . . . , n, filling in the values that we know and marking the values that we have not 
yet determined with *. 
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1 2 ... 777 — 1 777 777+1 



77—1 77 



B n ,a\[l,n] = * ... * * ... * 1 
B n ,b\[l,n] = * ... * * ... * 1 

B„,c|[l,n]= * ... * 1 ... 1 

Because f(n — 1) = 1 we know that either B_ n a (n — 1) = 1 or B_ n b (n — 1) 
1 and we may assume it is B_ n a (n — 1) = 1. We are now going to apply the 



following argument over and over again. By Theorem [¥j we know that if we 
find an interval of length I in a Sturmian sequence that contains t Is, then any other 
interval of length / in the same sequence has to contain at least t — 1 and at most 
t + 1 Is. In this case ones(_B„. a |[„_ 1 „]) = 2 and so both ones(-B„ ja |[ TO _ l!m ]) > 1 
and ones('3„. a |[„ i m+1 j) > 1, which means B n ^ a {m — 1) = B nM (m + 1) = 1 and 
consequently B n ^(m — 1) = B n j,(m + 1) = 0. Now our table looks as follows. 

1 2 ... 777—1 777 777+1 ... 77—1 77 

B n , a \ [1<n] = * ...* 1 1 *...* 1 1 
B ntb \ [1>n] = 0*...* *...* 1 

B n ,c\[l,n]= 0*...* 1 ... 1 

But now ones(i? nj ( ) |[ m _ 1 m+1 j) = and so B n ^(n — 2) = and B nia (n — 2) = 1. 
This gives ones(-B„, a |[„_2,„]) = 3 which allows us to deduce B n ^ a (m + 2) = 1 and 
B n ,b(Tn + 2) = 0. Then we have ones(i?„ j 6|[ m _ ljm+ 2]) = and so B n ^{n — 3) = 
and B n a (n — 3) = 1, which gives ones(/5 nia |[„_ 3j „]) = 4 and so B n ^ a (m + 3) = 1 
and B n ^(m + 3) = 0. Wc continue this argument inductively until we have shown 
that B nfi (k) = for 777 + 1 < k < 11 - 1. 

At this point both B n jj |[ m _i,„_i] and -Bn,c|[m+i,n-i] are intervals of consecutive 
0s, of length ?7 — 777, + 1 and n — m — 1, respectively, where the latter is maximal. 
So the blocks of B n ^ are strictly larger than the blocks of B n c . As the Is in 
Sturmian sequences are evenly distributed, this implies c = t(B n c ) > !(£>„, b) = 6, 
in contradiction to our assumption. □ 
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